METHOD AND SYSTEM FOR PROCESSING NETWORK DISCOVERY DATA 



BACKGROUND 



1. 



Technical Field 



5 



[0001] The present invention relates to discovering information concerning a 
computing network. More particularly, the invention concerns using state values 
corresponding with data aggregations to process network discovery data. 



2. 



Description of Related Art 

[0002] Modem computing systems often include a large number of computing 



10 



devices and storage devices. The computing devices and storage devices may be 
interconnected via a Storage Area Network (SAN), which generally permits the 
computing devices to access any of the storage devices that are connected to the SAN. 
[0003] A SAN provides an environment in which data storage devices are 

1 5 managed within a high speed network that is dedicated to data storage. Access to the 

SAN may be provided via one or more storage manager servers that handle data storage 
requests (for example, copy, backup, etc.) from data client nodes (data clients), for 
example via conventional LAN (Local Area Network) or WAN (Wide Area Network) 
connections. Storage manager servers are programmed data processing platforms that 

20 maintain interfaces to the client-side LANAVAN and to the storage-side devices that 
define the data storage network's pool of peripheral storage devices. These storage 
devices may include any number of interconnected magnetic disk drive arrays, optical 
disk drive arrays, magnetic or optical tape libraries, etc. Interconnectivity in a SAN may 
be provided with arbitrated loop arrangements, or more commonly, with switching 

25 fabrics. Typical interconnectivity components may include copper or fiber optic cables, 
hubs, bridges, gateways, switches, directors, and other data communication equipment 
designed for high speed data transfer between and among all of the interconnected 
storage manager servers and storage devices that may be coupled to (or that may be 
considered to be part of) the data storage network. 
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[0004] Management applications, for example SAN management applications, 
storage resource management applications, and/or storage management applications, are 
often employed to manage and configure SANs. Management applications may perform 
key functions such as discovering the components included in a SAN configuration, 
5 discovering attributes of the components, and discovering and mapping the physical and 
logical relationships between the components. As an example, component attributes 
could include physical and logical storage unit identification information and port 
information, and could also include file system information and operating system 
information. Generally, management applications may monitor network storage devices 
1 0 and their logical integration with storage manager servers, and may also monitor network 
interconnectivity components, and the storage manager servers. Management 
appUcations may perform these functions on a continual basis in order to manage the 
SAN configuration and/or to store and render current snapshots of the SAN 
configuration. 

1 5 [0005] The management applications may process SAN information that is 

received fi-om software agents that are located throughout the SAN. An agent is a logical 
entity that may reside on a network node, such as a storage manager server, a storage 
device, or a network interconnectivity component. An agent may be programmed to 
provide information about a portion of a data storage network to a supervising 

20 management application. Usually the information is obtained firom multiple agents, 

because a single agent typically will not have complete SAN configuration information. 

[0006] A network management server typically receives the information from 
the agents and attempts to create a coherent view of the network. Obtaining information 
from the agents may be called conducting a discovery poll. The term "discovery poll" 

25 refers to the discovery of data storage network information via a management 
application's agents, and the subsequent processing of that information by the 
management application. Discovery polls may be scheduled or on-demand, or may be 
triggered by asynchronous events. Obtaining and combining the information received 
from agents is generally resource intensive because discovery polls may occur firequently, 
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because a large number of agents may used to obtain complete information about a SAN 
configuration, because the information obtained firom the agents may be large and 
complex (depending on the size and complexity of the SAN configuration), and because 
the list of agents may grow and shrink dynamically. Due to the fact that new requests for 
5 discovery polls can arise while a previous discovery poll is being processed, discovery 
processing can become severely backlogged, and undesirable delays in fulfilling 
processing requests may occur. 

[0007] A network may include hundreds of network management servers, each 
of which has a discovery agent. With current methods for performing the discovery 

10 process, typically all of the discovery agents available in the network are used to retrieve 
and process information. This can result in large processing times, because the time 
required for obtaining and processing the information is proportional to the number of 
agents, and is also proportional to the size and complexity of the information gathered by 
the agents. Also, because all agents gather information even if the gathered information 

1 5 overlaps the information gathered by another agent, SAN performance may be adversely 
impacted due to the bandwidth utilized by the agents. Consequently, known techniques 
are generally inadequate for producing a current view of a network in a time efficient 
manner. 
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SUMMARY 

[0008] One aspect of the invention is a method for processing network discovery 
data. An example of the method includes defining a plurality of network data 
aggregations. This example also includes assigning a current state value to at least one of 
5 the data aggregations. This example further includes, for at least one current state value, 
determining if the current state value is different than a corresponding prior state value. 
This example also includes merging data corresponding with at least one data aggregation 
determined to have a current state value that is different than a corresponding prior state 
value, with prior data corresponding with at least one data aggregation determined to 

1 0 have a current state value that is not different than a corresponding prior state value. 

[0009] Other aspects of the invention are described in the sections below, and 
include, for example, a computing system, and a signal bearing medium tangibly 
embodying a program of machine-readable instructions executable by a digital processing 
apparatus to perform operations for processing network discovery data. 

15 [001 0] Some examples of the invention advantageously permit monitoring 

network discovery data received fi-om a number of sources, in a time efficient manner. 
Additionally, some examples of the invention have the capability to quickly determine 
when there has been no change in a system. Also, some examples of the invention reduce 
transmissions between agent services and a management client. Further, some examples 

20 of the invention permit analysis to be limited to the locality of changes in a network. 

Further, some examples of the invention are easily scalable to different sized networks. 
The invention also provides a number of other advantages and benefits, which should be 
apparent fi-om the following description. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
[001 1] FIG. 1 is a block diagram of the hardware components and 
interconnections of a computing system in accordance with an example of the invention. 
[0012] FIG. 2 is a block diagram of the hardware components and 
5 interconnections of a computing apparatus in accordance with an example of the 
invention. 

[0013] FIG. 3 is an example of a signal-bearing medium in accordance an 
example of the invention. 

[0014] FIGS. 4A-4B are a flowchart of an operational sequence for processing 
1 0 network discovery data in accordance with an example of the invention. 

[0015] FIG. 5 is a representation of a hierarchal order of data aggregations in 
accordance with an example of the invention. 
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DETAILED DESCRIPTION 
[0016] The nature, objectives, and advantages of the invention will become 
more apparent to those skilled in the art after considering the following detailed 
description in connection with the accompanying drawings. 

1. HARDWARE COMPONENTS AND INTERCONNECTIONS 

[0017] One aspect of the invention is a computing system for processing 
network discovery data. As an example, the computing system may be embodied by all, 
or portions of, the computing system 100 shown in FIG. 1. The computing system 100 
may include one or more clients 102, 104, one or more storage manager servers 106, 108 
(also called managed hosts), one or more network management servers 109 (also called a 
management client), a plurality of storage devices 1 10, 1 12, 1 14, a LAN 1 16, and a SAN 
118. 

[0018] The storage manager servers 106, 108 may communicate with the chents 
102, 104 via the LAN 1 16, or alternatively via a WAN. Data associated with the chents 
102, 104 may reside on the storage devices 1 10, 1 12, 1 14 that are coupled to the SAN 
118. As an example, the storage devices 110, 1 12, 1 14 may include one or more 
magnetic storage devices 110 (such as magnetic disk drives), optical storage devices 1 12 
(such as optical storage drives), and/or tape libraries 1 14, and any other suitable types of ^ 
storage devices. Generally, a large number of storage devices may be coupled to the 
SAN 118. The SAN 118 may also include connection components such as switches 132, 
134, gateway 136, and directors, hubs, links, etc. As an example switches 132, 134 may 
be IBM® model 2109 switches. 

[0019] The SAN 1 1 8 may be implemented with Fibre Channel, iSCSI, or any 
other suitable network technology or protocol. There are a variety of computer hardware 
and software components that may be used to implement the various elements that make 
up the SAN 118. Commonly, a Fibre Channel architecture implemented using copper or 
fiber optical media will be used to provide the physical and low level protocol layers. 
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Higher level protocols, such SCSI-FCP (Small Computer System Interface-Fibre Chamiel 
Protocol), IPI (Intelligent Peripheral Interface), IP (Intemet Protocol), FICON (Fiber 
Optic CONnection), etc., may be mapped onto the Fibre Channel protocol stack. The 
Fibre Channel architecture generally will dictate the choice of devices that may be used 
5 to implement the interconnection components of the SAN 1 1 8, as well as the network 
interface adaptors and controllers that connect the storage manager servers 106, 108, the 
network management server 109, and storage devices 1 10, 1 12, 1 14 to the SAN 1 18. In 
alternative embodiments, a low level network protocol, such as Ethemet, could be used to 
implement the SAN 118, In some alternative embodiments, a network other than a SAN 

1 0 could be utilized in place of the SAN 1 18. 

[0020] The storage manager servers 106, 108 may be configured as SAN 
application servers offering SAN access interfaces to the clients 102, 104. As an 
example, the storage manager servers 106, 108 and/or the network management server 
109 could be implemented with IBM® zSeries®, iSeries™, pSeries™ or xSeries® system 

1 5 products, each of which provides a hardware and operating system platform set. Each 
storage manager server 106, 108 may be programmed with higher level SAN server 
application software, such as the IBM® TIVOLI® Storage Manager system. Generally, 
the clients 102, 104, the storage manager servers 106, 108, and/or the network 
management server 109 could each be implemented, for example, with a computer 

20 running the AIX operating system, a Windows 2000 server , or generally any Intel based 
PC server system. In one specific example, the network management server 109 could be 
implemented with an IBM® xSeries model 330. 

[0021] The storage devices 1 10, 1 12, 1 14 may be implemented using any of a 
variety of data storage device products. As an example the storage devices 1 10, 1 12, 1 14 

25 could include an IBM® TotalStorage'^'^ ENTERPRISE STORAGE SERVER® (ESS) 
System, an IBM® FAStT 900 disk storage system, an IBM® proSCSI JBOD system, 
and/or an IBM® TotalStorage™ Enterprise Tape System. Generally, any suitable storage 
devices could be coupled to the SAN 1 18, and any suitable computing devices could be 
used to implement the chents 102, 104, the storage manager servers 106, 108, and the 
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network management server 109. 

[0022] The SAN 118 interconnection components may include switches 132, 
134, and gateway 136, and any number of directors, hubs, bridges, routers, additional 
switches and gateways, etc., which are generally widely available from a variety of 
vendors. As an example, the links interconnecting these components may be constructed 
using copper wiring or single-mode or multi-mode optical fiber. 

[0023] As mentioned above, data storage area networks typically employ 
standardized discovery entities, referred to as "agents," (which may be called discovery 
agents or discovery agent services), that assist high level management applications 120 
(also called network managers), obtain data storage network information. The 
management apphcations 120 may run on the network management server 109 (also 
called the management client 109). An agent may generally be thought of as an 
information gathering entity within a computing network, which has the ability to gather 
network information within a sphere of discovery capability. An inband agent 122 may 
be installed on storage manager server 106, and similarly, an inband agent 124 may be 
installed on storage manager server 108. Outband agents 126, 128 may run on the 
network management server 109. Additional agents may also be installed on storage 
devices 110, 112, 114, or on other devices, attachedto the SAN 118. As an example, the 
network management applications 120 and the outband agents 126, 128 that operate on 
the network management server 109 may be implemented using any of a variety of 
conventional network management software products, for example the IBM® TIVOLI® 
Storage Area Network Manager. One or both of the storage manager servers 106, 108 
may be managed hosts which are managed by the management applications 120 on the 
network management server 109. 

[0024] The management applications 120 may communicate with the inband 
agents 122, 124 via a via a communication network, such as the IP network that has IP 
network segments 130a and 130b, which may be separate from the communication 
pathways of the data storage network being managed (for example SAN 118). The 
outband agents 126, 128 may conduct discovery by issuing queries to the switches 132, 
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134 in the SAN 118, via the IP network segment 130a. In contrast, the inband agents 
122, 124 may conduct discovery by issuing queries via the communication pathways 
(network links) of the SAN 118. 

[0025] As an example, a Fibre Channel switch, may have sixteen Fibre Channel 
5 ports, and one Ethernet port (a management port), which may be coupled to a WAN or 
LAN, and which may be used for querying the switch. Inband agents 122, 124 must use 
the Fibre Channel ports for querying the switch. 

[0026] A variety of inband and outband discovery protocols have been 
developed for obtaining information about data storage network topology and component 

1 0 attributes. With respect to inband component attribute discovery, devices that may have 
SCSI interfaces, such as the storage devices 1 10, 1 12, 1 14, and the gateway 136, may be 
polled by the inband agents 122, 124 using SCSI queries to obtain device attribute 
information, including physical and logical storage unit identification information, port 
information, and the like. The inband agents 122, 124 may also perform self-discovery to 

1 5 obtain attribute information about the storage manager servers 106, 108, such as file 

system information and operating system information. As an example, inband topology 
queries may be performed by the inband agents 122, 124 using the Fibre Channel GS-3 
(Generic Services) protocol (FC-GS-3) and the Fibre Channel FS (Framing and 
Signaling) protocol (FC-FS), to query the switches 132, 136 and to obtain fabric 

20 configuration and end node information. In addition to being responsive to queries from 
the inband agents 122, 124, all devices implementing SCSI interfaces, HBA (Hardware 
Bus Adapter) drivers, and the FC-GS-3 protocol, will generally support the reporting of 
device and fabric events as they occur in the data storage network. With regard to 
outband discovery, a widely used protocol is SNMP (Simple Network Management 

25 Protocol). 

[0027] The outband agents 126, 128 may be configured to use management 
ports to issue SNMP queries to the switches 132, 134, as well as to receive port topology 
information and fabric events as they occur in the SAN 118. To that end, the switches 
132, 134 may implement SNMP agents (not shown) that interact with the outband agents 
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126, 128. The Network Management Server 109 may include a database (not shown) for 
storing discovery information. The management applications 120 and outband agents 
126, 128 and associated hardware on the network management server 109 may be called 
a manager appliance. 

5 [0028] The inband agents 122, 124 may include sets of scanners (not shown) 

that are programmed to obtain attribute and topology information using different 
discovery protocols, such as the SCSI protocol, the FC-GS-3 protocol, and the FC-FS 
protocol. Additional scanners, such as ED/FI (Error Detection/Fauh Isolation) statistics 
collection scanners, and CIM (Common Information Model) subsystem scanners, may 

10 also be included. The outband agents 126, 128 may include conventional SNMP support 
logic, which may include logic for gathering information described by a MEB 
(Management Information Base), logic for responding to queries, and logic for reporting 
network events (traps) to the management component. The SNMP support logic may be 
used to implement one or more outband scanners, such as a topology scanner and an 

1 5 ED/FI scanner. Additional scanners could also be provided for implementing proprietary 
APIs, such as the BROCADE® API (Application Programming Interface) for querying 
BROCADE® switches. 

[0029] An exemplary computing apparatus 200 is shown in FIG, 2. As an 
example, the cHents 102, 104, the storage manager servers 106, 108, the network 

20 management server 109, and any other computing devices in the computing system 100 
could be implemented with an example of the computing apparatus 200. The computing 
apparatus 200 includes a processor 202 (which may be called a processing device), and in 
some examples could have more than one processor 202. As an example, the processor 
may be a PowerPC RISC processor, available from International Business Machines 

25 Corporation, or a processor manufactured by Intel Corporation. The processor 202 may 
run any suitable operating system, for example, Windows 2000, AIX, Solaris™, Linux, 
UNIX, or HP-UX™. The computing apparatus 200 may be implemented on any suitable 
computing device, for example a personal computer, a workstation, a mainframe 
computer, or a supercomputer. The computing apparatus 200 also includes a storage 204, 
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a network interface 206, and an input/output 208, which are all coupled to the processor 
202. The storage 204 may include a primary memory 210, which for example, may be 
RAM, and a non volatile memory 212. The non- volatile memory 212 could be, for 
example, a hard disk drive, a drive for reading and writing from optical or magneto- 
5 optical media, a tape drive, non-volatile RAM (NVRAM), or any other suitable type of 
storage. The storage 204 may be used to store data and application programs and/or other 
programming instructions executed by the processor. The application programs could 
generally be any suitable applications. The network interface 206 may provide access to 
any suitable wired or wireless network. 

10 

II. OPERATION 

[0030] In addition to the hardware embodiments described above, other aspects 
of the invention concem a method for processing network discovery data. 

15 A. Signal-Bearing Media 

[0031] In the context of FIGS. 1 and 2, the method aspects of the invention may 
be implemented, for example, by having the network management server 109, and/or the 
storage manager servers 106, 108, and in some examples, also one or more of the agents, 
execute a sequence of machine-readable instructions, which can also be referred to as 

20 code. These instructions may reside in various types of signal-bearing media. In this 
respect, some aspects of the present invention concem a programmed product, 
comprising a signal-bearing medium or signal-bearing media tangibly embodying a 
program of machine-readable instructions executable by a digital processing apparatus to 
perform operations for processing network discovery data. 

25 [0032] This signal-bearing medium may comprise, for example, primary 

memory 210 and/or non-volatile memory 212. Altematively, the instructions may be 
embodied in a signal-bearing medium such as the optical data storage disc 300 shown in 
FIG. 3. The optical disc can be any type of signal bearing disc or disk, for example, a 
CD-ROM, CD-R, CD-RW, WORM, DVD-R, DVD+R, DVD-RW, or DVD+RW. 
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Additionally, whether contained in the computing system 100, or elsewhere, the 
instructions may be stored on any of a variety of machine-readable data storage mediums 
or media, which may include, for example, a "hard drive", a RAID array, a magnetic data 
storage diskette (such as a floppy disk), magnetic tape, digital optical tape, RAM, ROM, 
5 EPROM, EEPROM, flash memory, programmable logic, any other type of firmware, 
magneto-optical storage, paper punch cards, or any other suitable signal-bearing media 
including transmission media such as digital and/or analog communications links, which 
may be electrical, optical, and/or wireless. For example, in some embodiments the 
instructions or code may be accessible fi-om a file server over a network, or fi-om other 

1 0 transmission media, and the signal bearing media embodying the instructions or code 
may comprise a transmission media, such as a network transmission line, wireless 
transmission media, signals propagating through space, radio waves, and/or infirared 
signals. Additionally, the signal bearing media may be implemented in hardware logic, 
for example, an integrated circuit chip, a Programmable Gate Array (PGA), an 

1 5 Application Specific Integrated Circuit (ASIC). As an example, the machine-readable 
instructions may comprise software object code, compiled fi-om a language such as 

"C-H-". 

B. Overall Sequence of Operation 
20 [0033] Techniques for discovering network information fi-om discovery 

agents are described in United States Patent Application No. 10/666,046, filed 
September 20, 2003, titled "Intelligent Discovery of Network Information fi-om 
Multiple Information Gathering Agents" which is incorporated herein by reference. 

[0034] For ease of explanation, but without any intended limitation, exemplary 
25 method aspects of the invention are described with reference to the computing system 
100 described above and shown in FIG. 1. An example of the method aspect of the 
present invention is illustrated in FIGS. 4A-B, which show a sequence 400 for a method 
for processing network discovery data. 
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[0035] The operations of the sequence 400 may be performed the network 
management server 109, or by the network management server 109 in conjunction with 
one or more discovery agents, which may be located on one or more of the storage 
manager servers 106, 108, or elsewhere in the computing system 100. Alternatively, the 
5 operations of the sequence could be performed by one or more of the storage manager 
servers 106, 108, or by the network management server 109 and one or more of the 
storage manager servers 106, 108, and in some embodiments may also be performed in 
conjunction with other agents and components in the computing system 100. Agents 
outside of the network management server 109 are not required for practicing some 

10 examples of the invention, because some examples of the invention could run completely 
on the management server 109. 

[0036] Prior to operation of the sequence 400, typically, the system boundaries 
will be known, the agents and the capabilities of the agents will be known, and the 
desired discovery services will be identified. Sequence 400 may include, and may begin 

1 5 with, operation 40 1 , which comprises initially gathering a complete set of data. The 

complete set of data may be gathered by the set of designated SAN discovery agents for 
the system. The data may include, for example, zoning information, topology 
information, and/or device attributes information, etc. Data collection by the agent 
services may involve either polling devices or receiving notifications fi-om devices that 

20 give information about the SAN. 

[0037] The sequence 400 may also include operation 402, which comprises 
defining a plurality of network data aggregations (also called sections), which may also 
be characterized as defining data aggregation boundaries or as compiling, organizing, or 
dividing data into aggregations. Data aggregations generally are chosen so that data 

25 aggregations may be easily removed and merged when generating a view of a network. 
As an example, the plurality of network data aggregations may be defined based on 
zoning information including zone and zone set (which are logical entities). A zone may 
be defined as a set of N ports in a fabric that make up a logical boundary. Defining data 
aggregations by zone provides granularity. In another example, the plurality of network 
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data aggregations may be defined based on topology information, for example SAN and 
domain address (for example by using domain IDs associated with switches). Using 
topology information may be particularly useful for large SANs. In another example, the 
plurality of network data aggregations may be defined based on device attributes 
5 information, for example device and port, where similar types of devices, or product 
models could be grouped together. In other examples the data aggregations could be 
based on fabric. A fabric may be defined as a coherent interconnection of switch links 
having a common address space and a common protocol Different fabrics may be 
separated by a bridge or a router. For example, a Fibre Channel fabric and an iSCSI 

1 0 fabric could be separated with a gateway. In other examples, data aggregations could be 
based on switch boundaries. If switch boundaries are used, switches with high adjacency 
(switches that have the most interswitch links between them) may be grouped, and the 
boundaries may be defined at links chosen to minimize the number of links at the 
boundaries. If switch boundaries are used, interswitch Hnks may be used to identify data 

1 5 aggregations. Using switch boundaries provides granularity. Switch sets may be 

identified by World Wide names. In other examples the data aggregations could be based 
on virtualization layers, or could be based on SAN boundaries. In some examples a 
management client 109 could send a request that specifies aggregation policy, and may 
be able to optimize the boundaries. Generally, agent services collect the data and in some 

20 examples may organize the data. In some altemative embodiments, defining data 

aggregations permits providing event notifications, or notification of changes, that occur 
in one or more data aggregations. 

[00381 The following is an example of data aggregation boundaries, in an XML 
file using DTD format, prior to the assignment of state values. More specifically, the 

25 following is an example of a DTD (fi-om the IBM® Tivoli® SAN Manager agent design), 
which aggregates by SAN element type using an XML type format. (An example of a 
DTD output file with the data aggregations included, is discussed fiirther below.) In 
order to find a description of some type of device in the SAN desired to be seen, the 
appropriate code word may be searched for in the DTD. Fox example, to find a particular 
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zone set, the teim "ZoneSetldXml" could be searched. SanXml shows what switches are 
in ttie SAN, and ID is the World Wide Name of a primary switch. 
InterconnectElementXml displays the ports connected to each switch. ZoneXml displays 
what is in each zone. This first exemplary DTD output file is as follows: 
<?xml version="1.0?> 



DOCTYPE TopologyXml }> > 

ELEMENT ZoneMemberXml (UniqueldXml, ParameterXml*)> 

ELEMENT PlatformXml (UniqueldXml, ParameterXml*, FcNodeIdXml*)> 

ELEMENT SystemXml (UniqueldXml, ParameterXml*, ControllerIdXml*)> 

ELEMENT ControUerXml (UniqueldXml, ParameterXml*, FcNodeIdXml*)> 

ELEMENT FcPortXml (UniqueldXml, ParameterXml*, AttachedFcPortIdXml*)> 

ELEMENT FcNodeXml (UniqueldXml, ParameterXml*, FcPortIdXml*)> 

ELEMENT ParameterXml (NameXml, ValueXml)> 

ELEMENT UniqueldXml (#PCDATA)> 

ELEMENT FcPortXml (#PCDATA)> 

ELEMENT InterconnectElementldXml (#PCDATA)> 

ELEMENT ZoneSetldXml (#PCDATA)> 

ELEMENT ZoneldXml (#PCDATA)> 

ELEMENT FcNodeldXml (#PCDATA)> 

ELEMENT AttachedFcPortldXml (#PCDATA)> 

ELEMENT ControUerldXml (#PCDATA)> 

ELEMENT NameXml (#PCDATA)> 

ELEMENT ValueXml (#PCDATA)>}x!-End Topology.dtd->]> 



25 



[0039] The sequence 400 may further include operation 404, which comprises 
gathering data (collecting data) for each data aggregation prior to performing operation 
406 (described below). The data may be gathered by polling agents, or by receiving 
notifications from agents to gather data. 
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[0040] The sequence 400 may further include operation 406, which comprises 
assigning an initial state value for each data aggregation. Operation 406 may also include 
retaining the initial state values, and may further include retaining the corresponding data. 
The operation 406 of assigning an initial state value for each data aggregation may also 
5 include organizing the data in the data aggregations in a prescribed order (as discussed 
below with regard to operation 408). All of the information registered for by the 
management cUent 109, including the initial state data, may be transmitted from agent 
discovery services to the management client 109, so the management cUent 109 may 
construct an initial view (also called a graph) of the network. Alternatively, the 

10 management cUent 109 may assign the initial state values. State values are discussed 
further below with regard to operation 410. In some examples, CIM data could be 
packaged and sent via a distributed communication service such as http. 

[0041] Operations 401-406, described above, comprise an initialization phase of 
sequence 400 which need not be repeated. After operation 406 has been performed, 

1 5 initialization is complete, and the monitoring phase, described in operations 407-41 8 

below, may commence. One or more of the operations of the monitoring phase may be 
repeated as many times as desired. The advantages provided by some embodiments of 
the invention are most evident in the monitoring phase. 

[0042] Operation 407 comprises gathering data for each data aggregation to 

20 which a current state value is to be assigned, prior to performing operations 408 and 410 
(described below). Data need not be gathered for each data aggregation. For example, 
data could be gathered only ne^" a change. In a specific example, data could be gathered 
in a data aggregation defined by a fabric boundary (which, for example, could comprise 
four switches.) As in operation 404, the data may be gathered by polling agents, or by 

25 receiving notifications from agents to gather data. 

[0043] The sequence 400 may further include operation 408, which comprises, 
prior to the operation 410 of assigning a current state value to at least one of the data 
aggregations, organizing data in the at least one of the data aggregations in a prescribed 
order. Because the data is organized into a prescribed order, when the data is traversed to 
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determine the current state value of the data aggregation, the data is traversed in a 
particular order. The data could be placed in a canonical or normalized form, for 
example, by using any of the following techniques: performing a simple sort (for example 
by Fibre Chaimel World Wide Name), hashing, using B+, using a binary tree, or by 
5 using a physical or logical graph. In an altemative embodiment the data could be sorted 
into table rows and columns (for example for use with SQL). In another altemative 
embodiment XML could be used to organize data and network views. Rather than 
placing the data in a prescribed order, in another altemative embodiment, the operation 
410 (discussed below) of assigning a current state value to at least one of the data 
1 0 aggregations may comprise processing data in the at least one of the data aggregations in 
a prescribed order. 

[0044] The sequence 400 may further include operation 410, which comprises 
assigning a current state value to at least one of the data aggregations. Operation 410 
may also include retaining the current state values, and may further include retaining the 

1 5 corresponding data. The operation of assigning a current state value to at least one of the 
data aggregations may be performed by at least one agent discovery service, or 
alternatively, may be performed by a management client 109. As an example, each 
current state value may be a CRC (Cyclical Redundancy Code) value computed utilizing 
data associated with a corresponding data aggregation and a CRC polynomial. For 

20 example, a 32 bit CRC polynomial employed in Fibre Channel protocol could be utilized, 
wherein the polynomial has the following form: 

x^Vx^ W^+x^^+x^^+x^^+x' V+xV+xV+x+1. In another example, each 
current state value may be a checksum computed against data associated with a 
corresponding data aggregation. Any method for state assignment that describes the state 
25 of the aggregation could be used. The first DTD example above does not provide a place 
in the format for a state variable. The following DTD is modified to show a place for a 
state element for each aggregation type. The XML file is broken down by management 
data field type in the DTD file, which shows a summary of the XML file. This second 
exemplary DTD output file is as follows: 
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<?xmlversion="1.0?> 

<!DOCTYPE TopologyXml [<!- Topology.dtd -> 

<!ELEMENT TopologyXml (HeaderXml, SanXml*, InterconnectElementXml*, 
ZoneXml*, PlatfomiXml*, SystemXml*, ControUerXml*, FcPortXml*, FcNodeXml*, 
ZoneSetXml*, ZoneMemberXinl*)> 
<!ELEMENT HeaderXml (ParameterXml*)> 

<!ELEMENT SanXml (UniqueldXml, ParameterXml*, IntercomiectElementldXml*, 
FcPortldXml*, ZoneSefldXml*)> 
<!ELEMENTSanAggregationStateXml(UniqueIdXml)> 
<!ELEMENT InterconnectElementXml (UniqueldXml, ParameterXml*, 
FcPortIdXml*)> 

ELEMENT InterconnectelementAggregationStateXml (UniqueIdXml)> 
ELEMENT ZoneSetXml (UniqueldXml, ParameterXml*, ZoneIdXml*)> 
ELEMENT ZoneSetAggregationStateXml (UmqueIdXml)> 
ELEMENT ZoneXml (UniqueldXml, ParameterXml*, FcPortIdXml*)> 
ELEMENT ZoneAggregationStateXml (UniqueIdXml)> 
ELEMENT ZoneMemberXml (UniqueldXml, ParameterXml*)> 
ELEMENT ZoneMemberAggregationStateXml (UniqueIdXml)> 
ELEMENT PlatformXml (UniqueldXml, ParameterXml*, FcNodeIdXml*)> 
ELEMENT PlatformAggregationStateXml (UniqueIdXml)> 
ELEMENT SystemXml (UniqueldXml, ParameterXml*, ControllerIdXml*)> 
ELEMENT SystemAggregationStateXml (UniqueIdXml)> 
ELEMENT ControUerXml (UniqueldXml, ParameterXml*, FcNodeIdXml*)> 
ELEMENT ControllerAggregationStateXml (UniqueIdXml)> 
ELEMENT FcPortXml (UniqueldXml, ParameterXml*, AttachedFcPortIdXml*)> 
ELEMENT FcPortAggregationStateXml (UniqueIdXml)> 
ELEMENT FcNodeXml (UniqueldXml, ParameterXml*, FcPortIdXml*)> 
ELEMENT FcNodeAggregationStateXml (UniqueIdXml)> 
ELEMENT ParameterXml (NameXml, ValueXml)> 
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<!ELEMENT UniqueldXml (#PCDATA)> 
<!ELEMENT FcPortXml (#PCDATA)> 
<!ELEMENT InterconnectElementldXml (#PCDATA)> 
<!ELEMENT ZoneSetldXml (#PCDATA)> 
5 <!ELEMENT ZoneldXml (#PCDATA)> 

<!ELEMENT FcNodeldXml (#PCDATA)> 
<!ELEMENT AttachedFcPortldXml (#PCDATA)> 
<!ELEMENT ControUerldXml (#PCDATA)> 
<!ELEMENT NameXml (#PCDATA)> 
1 0 <!ELEMENT ValueXml (#PCDATA)>}><!.-End Topology.dtd-->]> 



[0045] Referring to FIG. 4B, the sequence 400 may further include operation 
412, which comprises, for at least one (and in some examples, for each) current state 
value, determining if the current state value is different than a corresponding prior state 

1 5 value, due to a change in the data. As an example, the corresponding prior state value 
may be the most recent state value before the current state value for a data aggregation. 
Changes in the network cause the data to change. The following are some examples of 
changes that may take place in a network: switches, gateways, routers, storage devices, or 
other components may be added or removed, or links may be degraded or improved. 

20 [0046] If desired, the comparison of current and prior state values may be 

conducted for data aggregations in a hierarchal fashion, for example using the hierarchy 
500 shown in FIG. 5. In this case the analysis may begin with larger data aggregations, 
and may be continued down to smaller data aggregations in a hierarchal order, wherein 
the smaller data aggregations (which could be as small as a single switch) may be subsets 

25 of larger data aggregations (which could be as large as an entire network). With this 
technique, instead of initially conducting time consuming low level comparisons, the 
management server 109 may start at a top level with a large data aggregation, which may 
be considered to be a compilation of subset data aggregations, and then may analyze 
subsets only if there is a state change in the superset. Thus, some examples of the 
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invention provide for hierarchal analysis at various granularities, and may be scaled to 
large environments. Conducting the analysis in a hierarchal order permits speeding up 
the processing by eliminating data aggregations from further analysis if they have no 
changes. For example, operations 407 to 412 may first be conducted at a top level 501 of 
5 the hierarchy 500 for a data aggregation 502 that includes an entire system. If in 

operation 412 it is determined that the current state value for the data aggregation 502 
that includes the entire system is the same as a corresponding prior state value, then the 
analysis may be efficiently ended at this point (and, if desired, the sequence 400 
beginning at operation 407 may subsequently be repeated for the data aggregation 502 

1 0 that includes the entire system). Thus, the analysis may advantageously be quickly 
completed when it is determined that the current state value is the same as a 
corresponding prior state value. This procedure is particularly useful in examples where 
a network management client 109 periodically polls agents to receive data. 

[0047] If the current state value for the data aggregation 502 that includes the 

1 5 entire system is different than the corresponding prior state variable, then operation 413 
(shown in FIG. 4B) may be performed, which comprises determining if a hierarchal 
analysis is to be conducted. If it is determined that a hierarchal analysis is not to be 
conducted, then one or more of operations 416, 417, and 418 may be performed, as 
discussed below. If in operation 413 it is determined that hierarchal analysis is to be 

20 conducted, then operation 414 (shown in FIG. 4B) may be performed, which comprises 
determining whether there is a lower level in the hierarchy that has not been processed. 
If in operation 414 it is determined that there is a level in the hierarchy that has not been 
processed (for which current and prior state values have not been compared), then 
operation 415 may be performed, which comprises checking state values for one or more 

25 data aggregations in the next level of the hierarchy that has not been processed. For 

example, a current state value may be compared to a prior state value for data aggregation 
504a, and a current state value may be compared to a prior state value for data 
aggregation 504b, for the middle level 505 of the hierarchy 500 (shown in FIG. 5). 
Performing this analysis at the middle level 505 of the hierarchy 500 permits identifying 
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the location of a change in the network with more accuracy, by identifying whether a 
change has occurred in the data aggregation 504a corresponding with the first fabric, or 
in the data aggregation 504b corresponding with the second fabric. (Although possible, it 
is unlikely that changes will occur in both fabrics in the time during the analysis.) After a 
5 fabric is identified in which the current state value is different than a corresponding prior 
state value, then operation 414, which comprises determining whether there is a lower 
level in the hierarchy that has not been processed, may be performed again. If in 
operation 414 it is determined that there is a level in the hierarchy that has not been 
processed, then operation 415 may be performed again. For example, if it was previously 

1 0 determined that a change occurred in the first fabric, then in operation 41 5, a current state 
value may be compared to a prior state value for data aggregation 506a, and a current 
state value may be compared to a prior state value for data aggregation 506b, in the 
bottom level 508 of the hierarchy 500 (shown in FIG. 5). On the other hand, if it was 
previously determined that a change occurred in the second fabric, then operation 415 

1 5 may be performed with state values for data aggregations 506c and 506d. Performing 

operation 415 with state values corresponding with data aggregations at the bottom level 
508 of the hierarchy 500 permits defining an aggregation of switches in which the change 
has occurred, without having to perform the analysis on all of the aggregations of 
switches in the bottom level 508 of the hierarchy 500. After an aggregation of switches 

20 is identified in which the current state value is different than a corresponding prior state 
value, then operation 414, which comprises determining whether there is a lower level in 
the hierarchy that has not been processed, may be performed again. If there is no lower 
level in the hierarchy that has not been processed, then one or more of operations 416, 
417, and 418 may be performed, as discussed below, starting with performing operation 

25 416 for the aggregation of switches for which the current state value is different than a 
prior state value. 

10048] Referring again to FIG. 4B, if in operation 413 it is determined that 
hierarchal processing is not to take place, or if in operation 414 it is determined that there 
are no levels in the hierarchy that have not been processed, then the sequence 400 may 
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further include one or more of operations 416, 417, and 418. Operation 416 comprises 
receiving data corresponding with at least one (and in some examples, corresponding 
with each) data aggregation that has a current state value that is different than a 
corresponding prior state value, for example, as identified in operation 412, or, if 
5 hierarchal processing is performed, as identified in the most recent prior performance of 
operation 415. Thus, data corresponding with data aggregations that have changes may 
be sent to the management client 109, and data corresponding v/ith unchanged data 
aggregations does not have to be sent to the management client 109. In some examples, 
data corresponding with at least one (and in some examples, corresponding with each) 

1 0 data aggregation that has a current state value that is not different than a corresponding 
prior state value could also be received by the management chent 109. The data may be 
received by the management client 109 firom agent discovery services. The management 
cUent 109 could request the data, or the data could automatically be transmitted from 
discovery agents. As an example, if a zone obtains an additional entry, with reference to 

1 5 the exemplary DTD files above, the data that would be sent to the management chent 109 
would only be the ZoneMemberXml. In an altemative example, all data could be sent to 
the management client 109. A release mechanism may be included, for discovery agents 
to erase the data transmitted to the management client 109 the after the management 
client 109 has received the data. 

20 [0049 J Operation 417 comprises requesting polling on data aggregations that are 

subsets of a corresponding superset data aggregation that have a current state value that is 
different than a corresponding prior state value. The management cUent 109 may request 
the polling. Polling may be requested because not all aspects of a SAN are equally 
important or interesting to a management service. This capability provides an agent 

25 service that allows registration not on just a type of event, but on a defined aggregation. 
This permits greater flexibility in management fiinctions, and eliminates data reporting 
and analysis that is uninteresting to the manager. 

[0050] Operation 418 comprises merging at least some data corresponding with 
at least one (and in some examples, corresponding with each) data aggregation 
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determined to have a current state value that is different than a corresponding prior state 
value, with prior data corresponding with at least one data aggregation determined to 
have a current state value that is not different than a corresponding prior state value. If 
the boundaries are well chosen, new data in data aggregations where there is no change 
5 can be ignored. In data aggregations where the state has changed, the old data may be 

discarded and replaced with new data. The new data for the changed data aggregations is 
appended to old data for data aggregations that have not changed. Thus, the management 
client 109 merges the data to reconcile a new system representation of the management 
data. For example, in the case of the second exemplary DTD output file discussed above 

1 0 wherein the data is stored in an XML file, the file format provides clear data aggregation 
boundaries that can be easily managed. For example, the XML starting and ending tags 
associated with a data aggregation, and standard file manipulation methods, can be used 
to identify, remove, and replace data in a changed data aggregation with data that 
represents the current state of the system. 

15 [0051] After operation 412 or operation 418, the monitoring phase of sequence 

400 may be repeated, beginning with operation 407. Thus, one or more of operations 407 
to 418 may be repeated. 

[0052] Some examples of the invention may be described as intelhgent 
discovery agent services for minimizing client analysis. Some examples may be 

20 implemented as a method and computing system for performing intelligent discovery of a 
SAN by organizing the data reported by SAN discovery agents into states corresponding 
with data aggregations, and using the state information to minimize transmissions 
between agent services and a management client 109 requesting discovery services. In 
accordance with some examples of the invention, one or more SAN management clients 

25 109 compare current state information with prior state information, without having to 
perform analysis steps where state data has not changed, thereby leveraging SAN 
discovery agent aggregation and minimizing analysis time. 



m. OTHER EMBODIMENTS 
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[0053] While the foregoing disclosure shows a number of illustrative 
embodiments of the invention, it will be apparent to those skilled in the art that various 
changes and modifications can be made herein without departing from the scope of the 
invention as defined by the appended claims. Furthermore, although elements of the 
invention may be described or claimed in the singular, the plural is contemplated unless 
limitation to the singular is expUcitly stated. 
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